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Abstract 

The hypothetical nonlocal box (NLB) proposed by Popescu and Rohrlich allows two spatially sepa- 
rated parties, Alice and Bob, to exhibit stronger than quantum correlations. If the generated correlations 
are weak, they can sometimes be distilled into a stronger correlation by repeated applications of the NLB. 
Motivated by the limited distillability of NLBs, we initiate here a study of the distillation of correlations 
for nonlocal boxes that output quantum states rather than classical bits (qNLBs). We propose a new pro- 
tocol for distillation and show that it asymptotically distills a class of correlated quantum nonlocal boxes 
to the value |(3-\/3 + 1) sa 3.098076, whereas in contrast, the optimal non-adaptive parity protocol for 
classical nonlocal boxes asymptotically distills only to the value 3.0. We show that our protocol is an 
optimal non-adaptive protocol for 1, 2 and 3 qNLB copies by constructing a matching dual solution for 
the associated primal semidefinite program (SDP). We conclude that qNLBs are a stronger resource for 
nonlocality than NLBs. The main premise that develops from this conclusion is that the NLB model is 
not the strongest resource to investigate the fundamental principles that limit quantum nonlocality. As 
such, our work provides strong motivation to reconsider the status quo of the principles that are known 
to limit nonlocal correlations under the framework of qNLBs rather than NLBs. 

1 Nonlocality distillation 

Consider two parties, Alice and Bob, spatially separated and isolated, interested in jointly computing some 
boolean function /(•,•)■ A third party, David, provides Alice with an input x (unbeknown to Bob) and 
Bob with an input y (unbeknown to Alice) and challenges them to compute the bit f(x, y). David allows 
Alice and Bob to communicate, but charges for each and every bit communicated between them. Alice and 
Bob therefore pre-agree upon a protocol that minimizes the amount of communication required for them to 
compute the bit f(x, y). This is what we know as communication complexity |[25l . 

It seems entirely impossible to jointly compute a non-trivial function if no information can be interchanged 
between Alice and Bob, and it is indeed one of the first results typically shown in any introduction to 
communication complexity. But as soon as one tweaks the models ever so slightly, surprising results are 
possible. The nonlocal box is one such tweaking. 

A nonlocal box (NLB) is a device shared between two parties that, in itself is incapable of transferring any 
information from Alice to Bob, or vice- versa. A nonlocal box takes two bits as input, a bit x from Alice and 
a bit y from Bob, and outputs two bits, a bit a provided to Alice (and only Alice) and a bit b provided to Bob 
(and only Bob). If the two input bits x and y from Alice and Bob equal (0, 0), (0, 1), or (1, 0), the box (by 
definition) provides Alice and Bob with identical bits. That is, either both of them receive or both of them 
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Figure 1: A nonlocal box. The figures depict the NLB as a single box rather than two invidual boxes shared 
between Alice and Bob. 



receive 1, each case happening with probability i. If the two parties both give the box a 1 as input, the box 
provides Alice and Bob with opposite bits x and y, again each of the two cases 01 and 10 happening with 
equal probabilities |. See Figure[l] 

The correlations related to the NLB and some of their key properties were initially discovered by Khalfm 
and Csirel'son [24 J in 1985. Reintroduced by Popescu and Rohrlich in their seminal 1994 paper OT1 . NLBs, 
have since undergone extensive scrutiny. A nonlocal box is by definition non-signalling. The marginal of 
the bit a received by Alice is uniform irrespective of whether Bob inputs or 1 to the box, and she thus 
does not obtain any information about Bob's input. Yet the parity of the two output bits a and b is perfectly 
correlated with the logical AND of the two input bits x and y. 

The perfect nonlocal box (as defined above) is powerful enough to render all of communication complexity 
trivial, i.e., any boolean function may be computed by a single bit of communication between Alice and 
Bob [ 16]. Even if we modify the box so that, for each of the four possible inputs, it provides an output of 
the expected parity only with probability at least 3+ 6 v ^ « 0.908, it would still be possible to compute any 
boolean function with bounded error using only a single bit of communication Q ! 

Our motivation for the current work develops from a simple open question, i.e., do noisy NLBs within the 
range cos 2 ( | ) < p < 3+ 6 allow for trivial communication complexity? The agenda in this approach is 
to show that quantum mechanics restricts correlation sources that result in a world in which surprisingly 
powerful information processing procedures could be performed. A different example of this line of work 
from cryptography is due to Buhrman et al. ifTUll . which shows that NLBs can be used to be perform any 
two-party secure computation. They build protocols for bit commitment and oblivious transfer using NLBs, 
both of which are known to be impossible to achieve using quantum mechanics. 

Nonlocality distillation refers to the extent by which we can turn weak nonlocal boxes into more pure 
nonlocal boxes through a protocol. The idea is to consider whether it is possible for the players to concentrate 
the nonlocality in n copies of an imperfect nonlocal source to form a stronger nonlocal correlation source. 
In this sense it may be considered similar to entanglement distillation. We have gained some understanding 
of when nonlocality can be distilled l[T7l[T^l20l l9ll2l[Tl l23l[T9l . when it cannot ll33ll and when it appears 
in bound form [8]. In general, the results suggest that distillation is only possible under special favorable 
circumstances and that large classes of nonlocal boxes are not distillable. 

The apparent limited distillability of N LBs even under adaptive protocols seems to suggest that distillation 
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Figure 2: A possible physical realization of the quantum nonlocal box. David who wants to compute a 
boolean function f(x,y) transmits the description of / as well as x and y to Alice and Bob respectively. 
Alice and Bob share the state \xy) with Charlie, who implements and distributes the results of the no- 
signalling map Q{\xy)). Finally, Alice and Bob send a and b to David, such that f(x, y) = a®b. 

may not be a strong enough framework to draw conclusions regarding limits on nonlocal correlations. With 
the introduction of the qNLB model, we hope to change the situation. Historically, it took a period of 
more than half a century to realize that a more feasible interpretation of Bell inequality violations is to view 
them as a resource for processing information, rather than as paradoxes. Apparently, the same restrictive 
reasoning haunts us where we view the violation of Csirelson's inequality as something to be written off 
as an impossibility. No doubt, it is crucial to determine the principles that determine bounds on quantum 
correlations, however, another approach is to construct communication models that produce exactly the 
correlations within the no-signalling polytope. 

We approach stronger that quantum correlations with this new perspective. Rather than considering a hy- 
pothetical box resource, the spatially separated parties Alice and Bob, are now provided access to a trusted 
third party Charlie. Charlie is allowed to communicate with Alice and Bob without allowing communication 
between Alice and Bob. Consider the scenario depicted in Figure [2] David who wants to compute a boolean 
function f(x, y), provides Alice and Bob with a description of / and the partitioned inputs x and y. Alice 
and Bob may now use another trusted party Charlie who simulates the actions of a NLB/qNLB. This allows 
Alice and Bob to determine and transmit a and b to David such that f(x, y) = a © b. The three parties are 
able to help in computing the function / without any of them having access to complete information. Alice 
and Bob know the function, but not the complete input, nor its value, while Charlie knows the input without 
knowing the function being computed. 

Charlie's actions can be modelled by a nonlocal box that produces correlated physical systems as output. 
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A quantum nonlocal box, abbreviated qNLB, takes as input a joint quantum state and outputs a joint quantum 
state. A priori, such a model may not obey our non- signalling requirement since any unitary Dab not on 
the form <g> U# allows for signalling (6j|30l. It thus may appear that a quantum generalization of the 
NLB model would always allow for signalling, but this only holds true if we restrict the maps to be unitary. 
Quantum nonlocal boxes that satisfy the non- signalling requirement and allow for quantum states as output 
are possible when we drop the requirement of the box being unitary. Such boxes have previously been 
studied under the notion of causal maps, completely positive trace-preserving maps, and non-signalling 
operations [30l 13 EH [TTH . 

As our main result, we show that qNLBs exhibit strictly stronger nonlocality distillation than NLBs when 
restricted to non-adaptive distillation protocols. We show that in such a scenario, the optimal non-adaptive 
nonlocality distillation protocol for Alice and Bob asymptotically performs better than the optimal non- 
adaptive distillation parity protocol for NLBs ||23l . 

Theorem 1 Quantum nonlocal boxes exhibit stronger nonlocality distillation for non-adaptive protocols 
than the optimal non-adaptive parity protocol for classical nonlocal boxes. 

We prove our main theorem by setting up a semidefmite programming framework [ 34 ] for analyzing non- 
adaptive protocols for qNLB distillation. We then use this framework to define and give a protocol for qNLB 
distillation and show that it outperforms the optimal non-adaptive protocol for classical nonlocal boxes lT23l . 
We show that our protocol is an optimal non-adaptive protocol for the class of correlated qNLBs, given 1, 2 
and 3 copies by constructing a dual solution that attains the same value as the primal. 

2 Distillation protocols 

We define the value of a nonlocal box as the sum of the biases that the parity of the box agrees with the 
logical and of the input bits, over all four possible inputs, 

V = S2 Pr[a0 b = x ■ y] - V] Pr[a®b^x-y]. (1) 

x,ye{0,l} x,ye{0,l} 

Brunner and Skrzypczyk considered and analyzed in [9] a class of NLBs that has only one-sided errors and 
labelled them correlated NLBs. 

Definition 1 A correlated NLB maps the three inputs 00, 01 and 10 to the output 00 with probability ^, and 
to the output 11 with complementary probability |. It maps the input 11 to either of the two outputs 01 and 
10 with equal probabilities |, and to either of the two outputs 00 and 11 with equal probabilities = |. 
Here p G [0, 1] denotes the probability that, on input 11, the output of the NLB is of odd parity. Similarly, 
q = 1 — p denotes denotes the probability that, on input 11, the output of the NLB is of even parity. 

The value of a correlated box is 3 + p — (1 — p) = 2(1 + p), and the value of a perfect NLB is 4. 

Consider now that Alice and Bob share n instances of a correlated nonlocal box, all with the same param- 
eter p. Their goal is to simulate the behaviour of a correlated nonlocal box with a better parameter p' > p 
by using some pre-agreed upon protocol. They may use the n NLB instances as well as shared randomness, 
but are not allowed to communicate. If their protocol achieves a higher value p' than p, we call the protocol 
a distillation protocol. A distillation protocol using n nonlocal boxes is said to be non-adaptive if Alice is 
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required to provide her input x to all n nonlocal boxes and Bob is required to provide his input y to all n 
boxes. 

A correlated NLB can be asymptotically distilled to a perfect NLB by an adaptive protocol GDI as follows. 
Consider a single execution of a correlated NLB with input bits x and y and output bits a and b. If the two 
inputs are both 1, a correlated NLB may output an incorrect correlation, whereas, if at least one of the two 
inputs is 0, the output is always correctly of even parity. Viewed from the perspective of the output bits a 
and b, if the parity a © b is odd, we can conclude that the two inputs bit were both 1, and that the output 
therefore is correct. Only if the output a © b is even, can we not conclude with certainty that the output 
is correct. An adaptive protocol can use this one-sidedness of error to distill to the asymptotically optimal 
value of 4 by patiently waiting till the first time a usage of the correlated NLB yields an output of odd 
parity. This can be detected distributively (but not locally), and once detected, the protocol adaptively (and 
distributively) adjusts further usages of the NLBs so that all future outputs are of even parity. The eventual 
distributive detection of an output of odd parity reveals that the input bits were both 1, and the lack of an 
output pair of odd parity indicates that at least one of the two input bits were 0. An odd parity output will 
eventually occur, allowing us to asymptotically distill to the optimal value 4. 

A non-adaptive protocol can in contrast not distill to the value 4. By not allowing for adaptiveness, the 
distributive detection of the parity of the output can not be fed back into the system, and the protocol then 
fails in taking full advantage of the knowledge it possesses. A non-adaptive protocol must patiently wait till 
all outputs are produced, at which stage its best strategy is to take the parity of a certain number k of its 
outputs 11231 . 

Theorem 2 ([23 1) The value attainable by any non-adaptive protocol using at most n correlated NLBs is 
upper bounded by 

v= {3-(q- P ) n ifO<P<± 
[2(1 +p) if^p^l, 

and this value is attainable by the parity protocol. 

In this work, we consider the case that the NLBs take quantum states as input and produce quantum states 
as output. See Figure [3] 

Definition 2 (Quantum nonlocal box) A quantum nonlocal box (qNLB) Q takes as input a product state 
\ipxy) £ { 1 00) , 1 01) , 1 10), |11)} 6 %a © %B an d outputs a state p xy G Ha © He such that for every map 
Ta : Ha ^ Ha and Tb : He > T~£b the following two no-signalling conditions hold, 

Tr A Q( (T A © 1) \ip xy ) (tp xy \) = Tr AQ\il) xy )(i>xy\ 
Tr B Q( (1 © r B ) \^xy)(^xy\) = Tr B Q|V^)(V^I- 

In particular, we consider the class of correlated qNLB that generalizes the class of correlated NLBs. 

Definition 3 A correlated qNLB maps the three inputs 1 00), |01) and | 10) to the pure state = ©^ ( 1 00) + 
|11)), and maps the input \11) to the mixed state p = p\<j>)((j)\ + q\ip) (ifj\, where \(j>) = ©^ (| 01) + 1 10) ) is 
a superposition over the two odd-parity states, p £ [0, 1] a probability, and q = 1 — p the complementary 
probability. 
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Figure 3: A quantum nonlocal box 
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Figure 4: A non-adaptive distillation protocol for qNLBs. 



Given that Alice and Bob share n copies of a correlated qNLB and measure observables A x and with 
eigenvalues ±1 for input bits x and y, respectively, the value attained for the CHSH inequality lfT3l is 



V 



% n 



(A <8> B + A ® Bi + Ai ® Bo)|^)^ n - Tr(Ai ® Bip 1 



(3) 



A qNLB is at least as powerful as an NLB: For any value of p, Alice and Bob can use a correlated qNLB to 
simulate the correlation of a correlated NLB by simply measuring each of their outputs in the computational 
basis. In this paper, we formally prove that qNLBs are strictly more powerful in extracting nonlocality than 
are NLBs. See Figure [4] for the structure of non-adaptive distillation protocol for qNLBs. We establish 
our main Theorem [T] by giving an explicit non-adaptive protocol that attains a higher distilled value for 
correlated qNLBs than the optimal parity protocol attains for correlated NLBs. The amount of distillability 
achievable by non-adaptive protocols for NLBs is characterized in [23], here specialized to correlated NLBs 
as Theorem |2] above. 

We summarize the known results on non-adaptive distillation of classical and quantum correlated nonlocal 
boxes in Tables [I] and [2] For NLBs, we have complete knowledge: correlated NLBs are non-adaptively 
distillable if and only if < p < |» an d the parity protocol of Forster et al. |[T9l is an optimal non- 
adaptive protocol [23]. For qNLBs, we show here that correlated qNLBs are non-adaptively distillable 



6 





NLB distill? 


Value 


p = 


no 


2 


< p < \ 


yes 


3-(q-p) n 


l^p^l 


no 


2(1 +p) 



Table 1: Non-adaptive distillation of correlated NLBs is possible if and only if < p < \, for which they 
can be asymptotically distilled to the value 3. 





qNLB distill? 


Value 


p = 


no 


2 


<p < \ 


yes 


(3 + (g-p)")cos(0) + l(l-(g-p)») 


v = \ 


no for n ^ 3 


i(3x/3 + l) 


\<v<\ 


no for n ^ 3 


3cos(0) — gcos(30) + p 


§<P<1 


no for n 3 


2(1 +p) 


p = 1 


no 


4 



Table 2: Non-adaptive distillation of correlated qNLBs is possible when < p < |, for which they can be 
asymptotically distilled to the value |(3>/3 + 1) « 3.098076. When i < p < 1, non-adaptive distillation 
is not possible using at most 3 qNLBs. Measurement angle <fi depends on p and is defined in Eq.[5] 

when < p < |, and that correlated qNLBs can not be non-adaptively distilled when | ^ p < 1 if we 
Alice and Bob are allowed to use at most 3 qNLBs. When i ^ p < 1, we show that the single-usage qNLB 
protocol of Piani et al. ll30l is optimal among all non-adaptive protocols using at most 3 qNLBs. 

The values attainable are plotted in Figure [5] When < p < |, qNLBs achieves a strictly larger value than 
NLBs for any fixed value of n. For < p < \, qNLBs can be asymptotically distilled to \(?>\[?> + 1) w 
3.098076, whereas NLBs can only be asymptotically distilled to the value 3. 



3 Our distillation protocol 

We propose the following protocol [P] for non-adaptively distilling correlated qNLBs. 

Protocol "P Let Alice and Bob share n identical copies of a correlated qNLB of parameter p and let them 
receive input bits x and y, respectively. Their observables A x and B y are given by 



A x = cos I ^ + x4>j Z + (-l) x sin I ^ + xcf>j X 
B, = cos +yAz- {-If sin (t + y A X. 
The operators Z and Xfor the observables A x and B y are chosen based on the value ofp, 



if < P < \ s v _ / ^" if < P < \ 
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Figure 5: Value attained by our Protocol V for correlated qNLBs (solid line) and the parity protocol for NLBs 
(dotted lines) [19]. For < p < \, parity distills to 3.0, while our protocol distills to \ (?>y/?> + l) « 3.098. 

For | ^ p < |, even though qNLBs attain a higher value than NLBs, no distillation occurs. Finally, for 
| ^ p ^ 1, both the protocols attain the same value without any distillation taking place. 



The measurement angle <ft depends on p and is chosen such that it maximizes the value attained for the 
CHSH inequality, 



cos" 



(<f>(p)) 



1 ( S+(q-p) n 
4 \l+(q-p) n 

1+2 

4? 

1 



i/O < p < \ 

if\^p<\ 



(5) 



The observables chosen by Alice and Bob in Eq.[4]in Protocol[p|depends on the probability p. If < p < |, 
Alice and Bob non-trivially use all n available qNLBs. They view those n qNLBs as a single qNLB and each 
applies an observable given by global measurement angle in a two dimensional space spanned by the two 
observables af n and a® n (see Eq. [4]). Thus viewed as a two-dimensional rotation, our chosen observables 
can be seen as a generalization of the measurements in the protocol of Piani et al. [30] for a single qNLB. 



If i 

11 2 



^ p ^ 1, Alice and Bob effectively choose to use only a single qNLB by applying the identity 
observable 1 = ( J \ ) on all but the first qNLB. The output bits a and b of Alice and Bob depend only on 
the output bits of the first qNLB. Therefore, an alternative protocol achiving the same values as ours, can be 
constructed in which Alice and Bob first choose to use a number k of qNLBs, discard the remaining n — k 
qNLBs, and then each apply an observable on the k selected qNLBs as in Protocol [p| When < p < \, 
they pick k = n, and when p ^ i, they pick k = 1. 



Having specified the four observables Ao, Ai, Bo, and Bi, we compute the value attained by our Protocol fP\ 
by plugging into Eq. [3] 

Lemma 3 Protocol\P\attains the value 



V 



(3 + (g-p) n )cos(0) + ^(l-(g 
2(1 + q) cos(0) +p 
[2(1 +p) 



if0<p<l 



(6) 



A complete proof of Lemma|3]is given in Appendix|A] In the next two sections, we prove that our protocol 
is an optimal non-adaptive protocol for any < p ^ 1 and any n ^ 3. 
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4 Protocol [P] is optimal for a single copy 



We now show that no other protocol can achive a higher value V than ours when Alice and Bob are given a 
single qNLB. When n = 1, the expression for the value V given in Eq. [3] simplifies to 

MAo ® B |^) + <V|Ao ® Bi|V) + (V>|Ai ® B |V>> - p(0|Ai ® Bi|0) - g(^|Ai (8) Bj|^). 

Using that the two Bell states \ip) and |</>) (given in Definition [5]) can be locally mapped to each other, 
= (1 a x )\(j)}, we rewrite the optimization problem in terms of a single state 

V = (i/>|A ® B + A Bi + Ai ® B -p(Ai 0cr x Bicj x ) - g(Ai ® Bi)|V>, (7) 
allowing us to apply Csirelson's conversion between observables and vectors fT4l[T5ll . as done in Wehner Il34l . 



Lemma 4 (Tsirelson H14U15H ) Let Ao, . . . , A m _i a«c? Bo, ... , B m _i fte observables with eigenvalues in 
the interval [—1, 1]. Then for any state shared between Alice and Bob, there exist real unit vectors 
xq, . . . , x m -i and yo, . . . , y m -i such that 

(^|A;®Bj|^) = Xi -yj (8) 

for all ^ i, j < m. Conversely, for any set of real unit vectors, xq, . . . , x m -\ and yo, . . . , y m -i, and any 
maximally entangled state \ip), there exist observables Aj and Bj with eigenvalues ±1 such that Eq. ^holds 
for all ^ i, j < m. 

We define five vectors, one vector for each of Alice's two observables Ao and Ai, one for Bob's observable 
Bo, and two vectors for Bob's observable Bi, 

x = (A <g)l) |<i/>) yo = (1<8B )|V>) z = (1® Bx)!^) 

Xl = (Ai®l)|^) 21 = (1® (OxBiOx))!^). 

Let G = [py] be the Gram Matrix of the five vectors {xq, x%, yo, zq, Zi}, 
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and set W to be the weight matrix 
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Figure 6: Geometric intuition for the choice of angle <j>, when < p ^ |. The choice determines the 
direction of measurements performed by Alice and Bob and how the vectors we obtain through Csirelson's 
vectorization relate to each other. For the single copy case z eve n and z 0( jd correspond to the individual vectors 
zq and z\ respectively. In the multiple copy case z even and z Q dd correspong to multiple vectors. 



Finding an upper bound on the value V in Eq.|7]then becomes equivalent to finding an upper bound on the 
primal value of the semidefinite program (SDP) 

max -Tr(GW) 

G 2. 

subject to G fc= ^ 
gu = lforalli G {1,...,5}. 

The constraint G )?= ensures that G is a Gram matrix, and the constraints that the diagonal entries of G are 
equal to 1 , ensure that the five vectors are of unit norm. From any valid solution to the primal, we can extract 
a set of five observables via Tsirelson's correspondence and construct a protocol that has the same value as 
the primal solution, and vice-versa, from any protocol, we can extract a set of five vectors, the Gram matrix 
of which is a primal solution having the same value as the value attained by the protocol. 

We prove our upper bound on the primal value by giving a feasible solution to the dual of value equal to the 
value V in Eq. [6] To conclude that our dual solution is feasible, we need to show that a particular matrix 
M. is positive semidefinite. Rather than attempting conveying a technical analysis of the roots of the matrix 
M's characteristic polynomial, we shall instead break the matrix M. into smaller parts and repeatedly apply 
the following simple observation about the eigenvalues of a matrix of dimension 2x2. 

Observation 5 A real-valued 2x2 matrix is positive semidefinite if and only if it has a non-negative diag- 
onal entry and its determinant is non-negative. 

To see this, notice that a symmetric real-valued matrix is positive semidefinite if and only if one of its two 
eigenvalues is non-negative and the product of its two eigenvalues is non-negative, which holds if and only 
if it has a non-negative diagonal entry and its determinant is non-negative. 

Lemma 6 The dual value of the SDP given in Eq. ^is upper bounded by the value attained by ProtocolfP] 
for n = 1 given in Eq. [6] 
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Proof Let b = (1, 1, 1, 1, 1) be a vector in R 5 . The dual of the primal SDP in Eq.[9]is 



min A • b 

A 



subject to K = 2 diag(A) - W fc= 0, 



(10) 



where A is a vector in R 5 and matrix diag(A) is of dimension 5x5 containing A» in the i th diagonal entry 
and zeroes off-diagonal. 

First consider the range 1 ^ p ^ 1. The dual solution 



P V\ 



A = (l, Pj l,|,|) 

has value A • b T = 2(1 + p), matching the value of the protocol given in Eq. [6] To show that the constraint 
K !>= for the dual problem is satisfied, express matrix K = 2 diag(A) — W as the sum of two matrices, 
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K = K 1 +K 2 



Matrix K 2 is a scaled projection with eigenvalues and 2p and is therefore positive semidefinite. For 
matrix K\, ignore its fifth row and column, which are zero, and conjugate the remaining 4x4 submatrix 
of K x by -j= ( } _} ) <g) 1, yielding the submatrix 
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-1 
o 





-1 
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2p-l J 



The upper- left 2x2 block is positive semidefinite, and, by Observation[5} the lower-right block is positive semidefinite 
when 3(2p — 1) ^ 1, which holds when p ^ |. We have shown that matrix K is the sum of two positive semidefinite 
matrices, and it is therefore positive semidefinite. 

Next consider the range < p < | . The dual solution 

A = coB(^)(l,g,l,<z J 0) + (0,| J 0,0,|) 

has value 2(1 + q) cosid) + p, matching the value of the protocol given in Eq. [6] for n = 1. (When n = 1, 
the expression in Eq. ppor the range < p < | simplifies to the expression for the range g ^ p < |.) It 
remains to show that the constraint K > is satisfied. Proceeding as in the case I ^ p ^ 1, we write 



K = K l + K 2 
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and conjugate the upper-left 4x4 submatrix of K\ by ^ ( \ _\ ) (g) 1, this time yielding the block matrix 

/ 2cos(<£)-l -1 \ 

-1 q{2 cos(» + 1) 

2cos(» + l 1 

V 1 g(2cos(» - 1) / 

The two diagonal entries 2 cos(0) + 1 and q{2 cos(<p) + 1) are non-negative since both cos(0) and q are 
non-negative. Both blocks have the same determinant Aq cos 2 (0) — (1 +q) which equals zero. (When n = 1, 
the expression in Eq.pl for the range < p < \ simplifies to the expression for the range \ ^ P < §•) 
Applying Observation]?} we conclude that K is positive semidefinite. □ 

We have proved that the dual SDP is feasible and has a solution of value no larger than the value attained 
by the protocol. By Tsirelson's correspondence, the protocol yields a feasible solution to the primal SDP of 
the same value as the protocol. These three values must therefore be equal. We conclude that our protocol 
is optimal for n = 1 and that the measurement angle specified by Eq.[5]is optimal. 



5 Protocol V is optimal for 2 and 3 copies 



In the preceding section, we show that no protocol can achieve a value higher than our Protocol [7£] when 
given only a single copy of a qNLB. We show that the same statement holds true for 2 and 3 copies of a 
qNLB: Among all non-adaptive protocols for distillation using at most 3 copies of a qNLB, none attains a 
value strictly higher than the value attained by our Protocol [P] using the same number of copies of a qNLB. 

Theorem 7 ProtocolfP\is optimal among all non-adaptive protocols using at most 3 copies of a qNLB. 

The proofs of the cases with multiple copies follow the outline we use in the simple single-copy case, 
except that now some of the steps become significantly more involved. We first give a general construction 
of a primal SDP and its dual SDP for any number n of copies of a qNLB, stated as Eqs. 14 and 15 in 
Appendix |B] below. The size of the dual SDP grows exponentially in the number of copies n utilized in the 
qNLB protocol. In Appendix [Cjbelow, we give a complete analytical proof of its value when n is at most 3, 
proving Theorem [7] 

Our proof technique is general and should in principle be extendable to any fixed higher value of n. More 
desirable, however, is to discover a method for analyzing our dual SDP for all values of n simultaneously. 
It seems plausible that such a generic proof technique should exist, but finding one has thus far eluded us. 
The dual SDP has an appealing representation in which we have been able to maintain many symmetries 



and letting it have an almost algorithmic structure. We expect that the solution value obtained in Lemma 1 1 
can be generalized to all values of n. 

One possible route in proving a generalized version of Theorem [JJ that holds for all values of n is to obtain 
a general form for the off-diagonal entries of the matrix Wtaii- If a proof of optimality for any n could be 
found, it would imply that we could make a statement equally strong to the NLB case, thus proving that our 
ProtocolFPlis optimal among all non-adaptive protocols for qNLBs. 
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6 Discussion 



Our qNLB distillation protocol outperforms the optimal parity protocol for NLBs due to the different route it 
takes to achieve distillation. The classical protocol determines the final output bits by computing the parity 
of the individual output from each box. For correlated NLBs, this leaves the expectation value unchanged 
for inputs 00, 01 and 10, while for input 11 the expectation is increased for values of p less than half. 
Our qNLB protocol performs better than the classical optimal protocol by hedging. Non-adaptively, the 
players can tweak their measurements so as to set up stronger correlations. The combined reduction in 
the expectation values for inputs 00, 01 and 10 is overwhelmed by the increase in the expected value for 
input 11. Effectively, the entangled measurements on qNLBs allow access to a set of strategies that are 
inaccessible to classical protocols for nonlocal boxes. 






Genie 










2/1,2/2, •• • ,Vn 



Figure 7: Assisted common information. 

The framework we consider in Figure[2]is analogous to the idealised secure scenario for two-party computa- 
tion model considered by Yao ll35l . The trust assumptions for Charlie may be unrealistic for cryptographic 
protocols but the model highlights the notion that it is possible to consider NLB correlations as a physical 
model rather than only as a hypothetical resource |T0l . Similar models have recently been considered 
under the notion of assisted common information (Figure [7]) within the cryptography community [32]. The 
bounds obtained for nonlocality distillation may allow for improved cryptographic limits within an appro- 
priate error model. 

One approach for obtaining an understanding of limits on quantum correlations is to develop underlying 
principles that are expected to be true for physical theories. The principle of information causality proposed 
by Pawlowski et al. ll29l . is one such principle. It states that the transmission of n classical bits can cause an 
information gain of at most n bits. The principle, a generalization of the no-signalling conditions, is violated 
by all correlations that violate Csirelson's bound. It is not known however, whether it is violated by all 
nonlocal correlations that are prohibited by quantum mechanics. Other physically motivated considerations 
include local quantum measurements due to Barnum et al. [4 ] and the uncertainty principle put forward by 
Oppenheim and Wehner l28l . 

In our current work we show that if we restrict out attention to non-adaptive protocols, qNLBs offer im- 
proved distillation over NLBs. A generalization of our SDP approach for qNLBs may provide a similar 
result for adaptive protocols. This may imply distillability for correlations that are currently not known to 
be distillable and at the same time an increased understanding of correlations that violate principles such as 
information causality. Similarly, a new protocol for qNLBs may close or reduce the trivial communication 
complexity gap Q. At the moment we do not have good insight into the structure of adpative protocols for 
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qNLBs or even a formulation that makes their analysis accessible. 

The principle of macroscopic locality proposed by Navascues and Wunderlich ll27ll . states that a physical 
theory should recover classical physics in the macroscopic limit. In terms of nonlocal correlation this implies 
that as the number of particles with Alice and Bob become large, the joint distribution p a b\ xy should admit a 
classical description. The principle suggests that quantum correlations identify exactly the set of correlations 
that are local macroscopically. The principle characterizes a slightly larger set, since Cavalcanti et al. lfl2l 
showed that macroscopically local correlations can violate information causality. 

The principle of macroscopic locality may still identify exactly the set of quantum correlations, but require 
a stronger resource in form of a qNLB to do so. Even if it turns out that our claim is invalid, the qNLB 
model offers an alternate interpretation of macroscopic correlations that is not accessible with NLBs. Apart 
from our obvious conclusion that qNLBs offer stronger distillability than NLBs, it is possible to interpret 
the distinction between quantum and classical attainable values in Figure [5] as a separation, in principle, 
between quantum and classical predictions at the macroscopic level. The result corresponds to a form of 
Bell's inequality at the macroscopic level and implies a physical experimental framework within which 
this separation may be observed. Admittedly the result under consideration is restricted to non-adaptive 
distillation protocols and the Brunner and Skrzypczyk adaptive protocol is known to distill correlated NLBs 
asymptotically to a perfect NLB. Establishing optimal adaptive distillation protocols for qNLBs that take 
into account coarse graining in measurements at the macroscopic level can demonstrate the following. 

1 . The principle of macroscopic locality identifies exactly the set of quantum correlations. 

2. There exist correlations observable at the macroscopic level do not admit a classical description. 

In either case, we are led to a conclusion which improves our understanding of the correlations attainable 
in nature. A consequence of macroscopic quantum correlations is the possibility of identifying physical 
processes in nature that utilize these correlations at the biological level. Recent results regarding the inner 
workings of photosynthesis and bird navigation via Earth's magnetic field suggest that it very well may be 

SO0. 

The main conclusion we draw from our work is that since NLBs are not the strongest resource for producing 
no-signalling correlations, it is not appropriate to restrict attention to only this model when concerned with 
questions regarding ultimate limits on quantum nonlocality. Indeed, we prove that the qNLB model offers 
stronger nonlocality distillability. Any protocol for NLBs can be simulated by qNLBs, whereas NLBs 
cannot simulate all qNLB protocols. We propose that questions that have so far been investigated within the 
framework of NLBs should be re-evaluated using qNLBs. 
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A Proof of Lemma [3] 

In this appendix, we prove that our Protocol [P] given in Section [3] attains the value given by Eq.[3j 
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Lemma 8 For a mixed state p = p\4>){4>\ + q\ip){ip\, where p £ [0, 1] is a probability and q = 1 
complementary probability, the following trace relations hold. 



p the 



Tr(a[ 
Trftr. 



X 

<8>n 



af n p® n ) 
a® n p® n ) 
a® n p® n ) 



(q-p) n 
1 

Tr(a? n (g 



a? n p® n ) 



0. 



Proof First consider the case n = 1, 

Tr(a z ® a z p) = q{ip\a z < 
Tr(cr x <g> a x p) = q(ip\a x 



' cr x \ 



+ p((/)\a z <g> a z 



+ 



q-p 
q+p 



1 



Tr(o- x ® a z p) = Tr(cj z (g> a x p) = q(tp\a x a x \ip) + p(4>\cr x ® cr x | 

For n > 1, using that the operators are separable, rewrite Tr(o"f n ® a®™ P® n ) = 
Pauli operators <n and 02, and apply the case n = 1. 



# = 0. 

(Tr(cri (T 2/ 9)) n for all 
□ 



We now prove Lemma[3]by substituting the appropriate expected values for each term in Equation[3] 

Proof of Lemma |3| Let Alice and Bob share n identical copies of a correlated qNLB and receive input 
bits x and y respectively. Application of Protocol [P] with observables and By yields the following 
expectation values for inputs 00, 01 and 10, 



|®n* 
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We reap the benefits of obtaining a lower value for the above inputs by obtaining a higher increase in the 
value for input 11 when < p < |. Applying Lemma [8j the expectation value for input 11 for < p < | 
is given by 



Tr (Ai Bi^ n ) = cos 2 
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af n p® n ) - sin 2 
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Equation 12 yields the value for i ^ p ^ 1 if we fix n = 1. The expression is simplified by choosing <p as 
specified in Protocol [P] and applying the following trignometric equivalence, 



cos(30) 



-2(9-p)" 
l+(q-p) n 

2f cos(0) 

1 



cos(0) if < p < \ 

\i\^p<\ 
if | < j> < 1. 
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The value V attained by Protocol [P] is obtained by substituting the expectation values 11 and 12 in Equa- 
tion [3] which gives 

V = (V| 0ri (Ao ® B + A ® Bi + Ai ® B )|V>® n - Tr(Ai B lP ® n ) 

= 3 cos (0) - i((i + ( g - cos (30) - (1 - (g - p)")). 

To complete the proof substitute Equation [13] in the expression for V and simplify to obtain 



V 



' (3 + (? - p) n ) cos(^) + |(1 - (q - p) n ) ifO<p< \ 
2{l + q) cos{(j))+p 
.2(1 +p) 



2 
2 

3 

if | ^ p < 1. 



if I < P < 



□ 



The value attained by Protocol fP\ when Alice and Bob share n identical copies of a correlated qNLB is 
strictly greater than the value attained by the optimal classical protocol for < p ^ \. To verify the claim 
we need to show that the following inequality holds f or < p ^ \ , 



3-1 < 



3 + 1 3 + 1 l-l 

where I = (q — p) n with I ranging between ^ I < 1. The inequality may be simplified to obtain, 

4(1 + /) < (3 + 0V3+T. 
If we substitute k = 3 + 1, with k ranging between 3 ^ k < 4, we obtain the inequality, 

4k - kVk-8 < 0. 

The inequality is verfied by checking that the expression on the left hand side is negative for < k < 4 and 
has roots at k equal to 4. The limit of (g — p) n as n approaches infinity is for < p ^ \. We conclude that 
Protocol jpj asymptotically distills correlated qNLBs to the value 2(3\/3 + 1) as 3.098076 for p less than a 
half. 



B Constructing the n copy SDP 

Let Alice and Bob share n identical copies of a correlated qNLB and receive input bits x and y respectively. 
Alice and Bob apply the observables A^ and B y respectively, as specified in Protocol [P] Recall that the 
value attained for the CHSH inequality is 

V = (V| 0n (A o <g> B + A ® Bi + Ax ® B o )|^) 0n - Tr(Ai ® B lP ® n ). 

We define N = 2 n + 3 vectors, one vector for each of Alice's two observables A x , one for Bob's observ- 
able Bo, and 2 n vectors for Bob's observable Bi. Let the 2 n vectors z s be indexed by a length n bit string s 
in {0, 1}™ and define X s = a* 1 ® a* 2 ® • • • ® a^ 1 so that, 

Xo = (A o ®l ")|V'> 0n 
x x = (Ai ® l (glri )|^) 0n 
y = (t® n ®B )\^)® n 
z.s = z SlS2 ... Sn = {t® n ® (X.BxX,))^)^. 



18 



Let G = [gij] be the Gram Matrix of the N vectors {xq, xi,yo, zq^, z Qn -i l , . . . , Zin}. Set W to be the 
symmetric weight matrix 



Wij = Wji 



1 if(t,i)€{(l,3),(l,4),(2,3)} 
_qn-\s\p\s\ if i = 2 and j = 3 + | s| 
otherwise, 



where |s| is the Hamming weight of the bit string s. Let s,s',t and t' be length n bit strings in {0, l} n 
such that s ^ s' and t ^ t' . Optimizing the value V attained by Protocol [P] is then equivalent to finding an 
optimal primal solution to the following SDR 

max -Tr(GW) 
G 2i 

subject to G fc= 

gu = lforalHe {1,...,N} 

P3+| S |,3+| S 'i = 93+\t\,3+\f\ if and onl y if s e s' = t e t'. 

We already encountered the first two set of constraints in the primal for the single copy SDP in Section |4] 
These constraints ensure that the matrix G is a Gram matrix and the N vectors used in its construction have 
unit norm. The new set of constraints are derived from the (2 n ~ 1 — l)(2 n — 1) inner product restrictions of 
the form z s • z a i = z t ■ Zy on the z s vectors. 

To obtain the dual, we define vector A' in R M , with M = 2 n + 3 + (2"- 1 - l)(2 n - 1), where the first 
N = 2 n + 3 components contribute to the solution value of the dual and the remaining entries correspond 
to the additional constraints. To distinguish between these two different roles we partition A' into two 
component vectors n and r such that, 

A' if 1 < i < N , f ifl <i<N 



^ ~ { o* if n < m and n ~ ) A' if V < / v :\/\ 

Define vector b E R M and let b{ = 1 for % ^ N and otherwise. Given four unique length n bit strings 
s,s',t and t', for each constraint of the form s © s' = t © t', define a matrix forN<k^ M, 



hij 



1 if i = 3 + | s | , j = 3 + | s' | such that s © s' = t © t' 
-1 if i = 3+ \ t\ , j = 3 + \t'\ such that s© s' = t@t' 
otherwise. 



The Lagrangian for the problem is given by 

1 M 
C{G,X',Z) = -Tr(Giy) + Tr(ZG)+Tr(diag(/i)-diag(^)G)- £ r k Tv{H k G) 

k=N+l 



X'-b + Tt (J^W + Z - dmg(fi) - r k H k jG^, 



where A' and Z are the dual variables. The dual function is then given by 

g(X', Z) = sup £{G, X>,Z) = { X '- b *\ W + Z ~ dia g(^) " Y%=n+i ^H k = 
v ; G I +°° otherwise. 
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The dual problem may be stated as follows, 



min A' • b 

A' 

1 M 

subject to -W + Z — diag(/x) — T kHk = 

k=N+l 

We simplify the formulation by removing variable Z and defining A = 2A' to obtain, 

min A • b 

A' 

subject to K = 2 ( diag(/x) - ^ r fc i? fc ) - W )?0. 

k=N+l 

Oppenheim and Wehner [ 28 ] have recently shown that the strength of nonlocality is related to the uncertainty 
principle and entanglement steering. The off-diagonal constraints in our SDP formulation and the value they 
take may reveal additional insights about this relationship. Violation of one of the constraints may imply 
violation of a linked uncertainty relation. In essence the constraints form a restriction on entanglement 
steering, where given Alice's measurements they restrict the states that Bob may now prepare and vice- 
versa. 

In Appendix [C] we utilize the above formulation of the dual to show that Protocol [P] is the optimal non- 
adaptive protocol for Alice and Bob when they have access to 2 or 3 copies of correlated qNLBs. 



C Optimal dual solutions for 2 and 3 copies 



The main idea we use to show optimality for the 2 and 3 copy cases, as in the single copy case is to 
break up the constraint matrix K into a sum of matrices K = Wkead + WW an d show that each matrix 
is positive semidefinite. We decompose K such that there is a fixed size 4x4 matrix Whead> while the 
matrix Wtaii has size (2 n + 1) x (2 n + 1). We begin by defining the a cut-off value x that determines the 
decomposition of K into Whead and W ta ii- 
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Define the matrix, 



head 



/ 


Ai 





-1 


"1\ 







h 


-1 


X 




-1 


-l 
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The diagonal entries Ai and A3 are exactly the first and third components of the dual solution vector A, while 
the entries l\ and I2 only have a partial contribution to the entries A2 and A4. Next we determine the diagonal 
values of Whead and show that the matrix is positive semidefinite for these values. 



20 



Lemma 9 The dual value for matrix Wh ea d is given by 



V 



(l+x) 3 



ifO<p<l 



Proof We fix Ai = A3 and l\ = I2 and choose the diagonal entries as follows, 

and l\ = 



i + i if < p < § 



if § < p ^ 1 



xAi 
1-x 



if < p < f 



if § < p < 1. 



The dual value V achieved by Whead can De calculated by summing up the diagonal entries. For < p < |, 
we obtain 



V' = Ai+xAi = + 



'l + x) 



(i + xY 



as required by Lemma [9] Similarly, for | ^ p ^ 1, sum of the diagonal entries in Whead is given by 

V' = 2 + l-x = 3-x. 
To prove that the matrix is positive semidefinite we conjugate the matrix Whead by H ® 1, 

-1 

(H <8 l)Whead(H <8 1) 



/ Ai-1 
-1 







h + x 
Ai + 1 
1 



\ 


1 

h - xj 



The matrix is positive semidefinite if (Ai — 1) (Zi + x) ^ 1 and (Ai + l)(/i — x) ^ 1. Since both these 
inequalities hold for our choice of Ai and l\, the matrix Wh ea d is positive semidefinite. □ 

Unfortunately, we do not obtain a fixed size matrix W ta n similar to Wh ea d that works for all n. To provide an 
overview of the dual constraints involved, we begin by giving a detailed construction of the W ta u matrix for 
the 3 copy case. Let | A) = \ip}® 3 . We define the vectors z s as follows, 



^000 
2001 
^010 

^001 
^100 
^101 
2110 
2111 



I 5 
I 5 

I 5 
I 5 
I 5 
I 5 



Bi|A) 

(1 <8> 1 (8) ct x )Bi(1 <8> 1 <8> cr x )|A) 
(1 <8 a x (8 l)Bi(l (8> cr x (8> 1)|A) 
(1 (8 cr x (8 (Tx)Bi(l ® cr x (8 cr x )|A) 
(cr x 1 (8 l)Bi(cr x 010 1)|A) 
(cr x 1 (8 cr x )Bi(cr x (8 1 (8 cr x )|A> 
(cr x cr x (8 l)Bi((7 x (8 cr x (8 1)|A) 
(cr x (8 <T X (8 cr x ) Bi((7 x (8 cr x (8 <r x )|A). 
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The following inner product constraints apply on the vectors z s due to their definition. These are exactly the 
additional constraints required for the 3 copy dual solution. 



^001 


£000 = 


£110 


£111 = 


£011 


£010 = 


£100 


£101 


£010 


^000 = 


£101 


£111 = 


£100 


£110 = 


£001 


£011 


£011 


£ooo = 


£100 


£lll = 


£001 


£010 = 


£110 


£101 


£100 


£ooo = 


£011 


£lll = 


£110 


£010 = 


£001 


£101 


£101 


£ooo = 


£010 


£lll = 


£001 


£100 = 


£011 


£110 


£110 


£ooo = 


£001 


£lll = 


£100 


£010 = 


£011 


£101 


Zlll 


£ooo = 


£001 


£110 = 


£101 


£010 = 


£100 


£011 



The dual constraint matrix K for the 3 copy case is given by, 



Ai 





-1 


-1 




















\ 





A 2 


-1 


Q 3 


q 2 p 


q 2 p 


qp 2 


q 2 p 


2 

qp 


qp 2 


p 3 


-1 


-1 


A 3 


























-1 


Q 3 





A 4 


























q 2 p 





Al2 + A19 + A26 


A 5 























2 

TV 





A13 + A20 + A27 


-A21 


A 6 




















qp 2 





Al4 + A21 + A28 


— A27 


— A19 


A 7 

















q 2 p 





A15 + A22 + A29 


— A23 


— A24 


— A32 


As 














qp 2 





A16 + A23 + A30 


— A29 


— A25 


— A31 


— A26 


A 9 











qp 2 





A17 + A24 + A31 


— Ais 


— A22 


— A30 


— A20 


— A28 


A10 








p 3 





A18 + A25 + A32 


— A17 


— Ai6 


— A15 


— A14 


— A13 


— A12 


An/ 



where only the lower triangular matrix is shown for the constraints. We decompose K into Whead which 
contains contribution only from the upper left 4x4 block matrix with the remaining entries contained 
in Wtaii. 

K =W head + Wtaii 





Ai 





h 


-1 
-1 


X 




















-1 


-1 


A3 





















v- 


-1 


X 





h) 




















(0 































\ 







h 





3 

q — x 


q 2 p 


q 2 p 


qp 2 


q 2 p 


qp 2 


2 

qp 


p 3 












































q 3 - x 







k 2 




























2 

q p 





A12 + 


A19 + A26 


A 5 




















+ 





2 

q p 





A13 + A20 + A27 


— A21 


A 6 






















2 

qp 





Al4 + A21 + A28 


-A27 


— A19 


A 7 



















q 2 p 





A15 + A22 + A29 


— A23 


— A24 


— A32 


As 
















2 

qp 





A16 + 


A23 + A30 


— A29 


— A25 


— A31 


— A26 


A 9 













2 

qp 





A17 + A24 + A31 


— Ais 


— A22 


-A30 


— A20 


— A28 


A10 







v° 


p3 





A18 + A25 + A32 


— A17 


— A16 


— A15 


— Ai 4 


— A13 


— A12 


An/ 



(18) 
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where A2 = l\ + k\ and A4 = 1% + k^. In the following Lemma 11 we construct specific dual solution 
matrices that satisfy the constraint matrices of the above form for 2 and 3 copies of correlated qNLBs. The 
proof of Lemma 1 1 for the case n = 3 utilizes the following generalization of Observation [5] 

Theorem 10 (Corollary 7.2.4 in [22J) Let Abe an x n Hermitian matrix, and let 

PA(t) 



4-Ti _i .n— 1 1 1 .n—m 



(19) 



be the characteristic polynomial of A. Suppose that {) ^ m ^ nanda n — m 7^ 0. Then A is positive semidefinite 
if and only if Ofar all n — m ^ k ^ n and a^ak+i < Ofor k = n — m, . . . , n — 1. Wfe define a n = 1. 



Even though the formulation of Lemma [TT] applies to the general n copy case, we prove it only for 2 and 3 

3, where x is defined 



copy case due to the complexity of the off-diagonal constraints. 

Lemma 11 The matrix Wtaii attains a dual solution value 1 — x, for n = 2 and n 



in Equation 16 



Proof First consider the case n = 2, for the range | ^ p ^ 1. The matrix 



W tail = W X -W 2 

(1 




p 







\ 






Vo 



p 



2 / 

2/ 



/ q 

q 

-q 

-g 

V-p 










-p\ 




0/ 



attains a dual solution value equal to 1 — x = p. It remains to show that W^w !>= 0. The matrix W t m\ has 
rank 2 and its row space is spanned by its first two rows. The upper left 2x2 submatrix p 



is 



positive semidefinite by Observation [5] since q ^ |. It follows that W t aii is positive semidefinite as well. 
Next consider the range < p < \, for n = 2. We define the matrix W ta ii as 

mail = Wi - W 2 



/2g 















( ° 


p 


-9 


-Q 


-p\ 





2 


P 
2 


p 

2 


2 




p 

















p 

2 


9 
2 


'/ 
2 


;j 
2 


-P 



















p 

2 


9 
2 


9 
2 


p 
2 


















\o 


<? 
2 


P 
2 


P 
2 


2/ 




V-p 











0/ 



which attains a dual solution value 2pq = 1 — x. The matrix has rank 2 and its row space is spanned by 

2q -p 



its first two rows. The upper left 2x2 submatrix ( 
p < q. It follows that Wtaii is positive semidefinite as well. 



-p 



is positive semidefinite by Observation 5 since 



For the case n = 3, we begin by removing the two zero rows and columns from the W u n matrix as specified 



in Equation 18 We further restrict columns of the same Hamming weight to be equal. This corresponds 
to equating the three cases each for weight 1 and 2 identified by the entries q 2 p and qp 2 respectively. The 
reduction in the number of constraints A allows us to consider a 5 x 5 matrix. A construction for the case 
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^ \ yields the matrix 



( p(3q 2 +p 2 ) 
-3qp 2 



\ 



q 2 p 



QP 



P 



V2 

-3qp 2 

9 g 2 p 3 



2 g 2 +p 2 

-hp 2 

3 q 2 p 3 

— <T 



■+p 2 

,2 



V3 
q 2 p 

-hp 2 

1 9 4 P 

2 g 2 +p 2 

|V 

g 2 p 2p 2 — g 2 
2 g 2 +p 2 



U 4 

qp 2 

3 g 2 p 3 
2 g 2 +p 2 
1 2 

i<?p 

1 q 2 p 3 

2 g 2 +p 2 

1 2 
jtfP 



^5 
P 3 



\ 



-hp 2 

q 2 p 2p 2 — q 2 
2 g 2 +p 2 
1 2 

p 3g 4 -4g 2 p 2 +2p 4 
2 g 2 +p 2 



7 



(20) 



The rank of WtaU ma y be further decreased by noting that v\ = 3v% + v$ and V2 
to consider the rank 3 matrix spanned by v±, V3 and 774, given by 



-3t>4. We now choose 



tail 



/p{3q 2 +p 2 ) 
\ 



2 

q p 



qp- 



2 

TP 



1 2 



QP 
1 2 
i^P 

1 q 2 p 3 

2 q 2 +p 2 



(21) 



The dual value for Wtaii in Equation 21 is given by p (3q 2 + p 2 ) which equals 1 — x. A similar construction 
for the case i < p < 1 yields the matrix 



p 

g 3 - 9 

q 2 p 

qp 2 
,3 



L'9 



V p 



qpp 



1 

2p+2 



2 q' 2 +p 2 
q 2 p g 2 +4gp+p 2 
4 q 2 +p 2 
qp q 3 +qp 2 +p 3 



3 9 

-f<TP 



«3 
2 

O 

g 2 p g 2 +4gp+p 2 
4 g 2 +p 2 

1 g 3 p 2 

2 q 2 +p 2 
1 2 

|<7 P 

g 2 p g 2 -gp+p 2 



u 4 

2 

<?P 

gp g 3 +gp 2 +p 3 
2 g 2 +p 2 
1 2 

|g p 

1 gp 4 

2 g 2 +p 2 
g 2 p g 2 — 4gp+p 2 

4 g 2 +p 2 



^5 

-f« 2 P 

g 2 p g 2 -gp+p 2 

2 g 2 +p 2 
g 2 p g 2 — 4gp+p 2 



4 

p 2 7p 3 - 



g 2 +p 2 

13p 2 +llp-3 
g 2 +p 2 
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The rank of W ta u may be further decreased by noting that v\ = 3v% + v 5 and ^2 
choose to consider the rank 3 matrix spanned by v\, V3 and U4, given by 



-2^3 — U4. We now 



tail 



/ P 

2 

q p 
\qp 2 



2 

q p 

1 g 3 p 

2 g 2 +p 2 
1 2 
i? P 



4 ( 

1 gp 4 



P 



(22) 



2 g 2 +p 2 , 



The dual value for W ta u in Equation 22 is p, which equals 1 



x. 



The fact that the Matrices 21 and 22 



are positive semidefinite may be verified by application of Theorem 10 to the characteristic polynomials of 
these matrices. □ 

The final part of our analysis constitutes the proof of Theorem [7] which is obtained by combining the dual 
solution values for both the matrices Whead an d WW. 



Theorem |7l We prove that the Protocol [P] is optimal for 2 and 3 copies by combining the dual 
>m Lemmas |9| and fill No distillation for the range \ < p $C 1 implies the dual solution values are 

3. Also, the value attained matches the value attained by Protocol fP\ and 



Proof of 

values from Lemmas |9| and [TT] 
the same for both n = 2 and n 
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is therefore tight. For | < p ^ 1, we have 



V = 3 — x + 1 — x 
= 4 - 2x 
= 2(l+p). 



For ^ < p ^ I, we obtain 



' (2-p) 3 I P 

1 — p 



v-p^y^+p 

2(2 -p) cos(» +p 

3 cos((/>) — g cos(3(/>) + p. 



For < p ^ i and n = 2, 



+ 3+(q-p) 2 l-(q-p) 2 

v 2 ;yi + ((? -p)2 + 2 

= (3 + (? - P) 2 ) cos(0) + 1(1 - (g - p) 2 ) 



= (3 + (g - p) 2 ) cos(4>) + 2pg. 
Finally, for < p ^ | and n = 3, 



/3_+^-p)3\ 3 + (q-p) 3 l-(q-pf 

v 2 ;yi + (g- P )3+ 2 

= (3 + (g - p) 3 ) cos(</>) + ^(1 - (q - pf). 



□ 



This concludes our proof of Theorem [7] that establishes qNLBs as a stronger resource for nonlocality for 
non-adaptive protocols. We have shown that if we restrict out attention to non-adaptive protocols, qNLBs 
offer improved distillation over NLBs. 
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